Paving the Way to a Large-scale Pseudosense-annotated Dataset

نویسندگان

  • Mohammad Taher Pilehvar
  • Roberto Navigli
چکیده

In this paper we propose a new approach to the generation of pseudowords, i.e., artificial words which model real polysemous words. Our approach simultaneously addresses the two important issues that hamper the generation of large pseudosense-annotated datasets: semantic awareness and coverage. We evaluate these pseudowords from three different perspectives showing that they can be used as reliable substitutes for their real counterparts.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Updating Methods in Building Blocks Dataset

With the increasing use of spatial data in daily life, the production of this data from diverse information sources with different precision and scales has grown widely. Generating new data requires a great deal of time and money. Therefore, one solution is to reduce costs is to update the old data at different scales using new data (produced on a similar scale). One approach to updating data i...

متن کامل

A Large-Scale Pseudoword-Based Evaluation Framework for State-of-the-Art Word Sense Disambiguation

The evaluation of several tasks in lexical semantics is often limited by the lack of large numbers of manual annotations, not only for training purposes, but also for testing purposes. Word Sense Disambiguation (WSD) is a case in point, as hand-labeled data sets are particularly hard and time-consuming to create. Consequently, evaluations tend to be performed on a small scale, which does not al...

متن کامل

High performance of the support vector machine in classifying hyperspectral data using a limited dataset

To prospect mineral deposits at regional scale, recognition and classification of hydrothermal alteration zones using remote sensing data is a popular strategy. Due to the large number of spectral bands, classification of the hyperspectral data may be negatively affected by the Hughes phenomenon. A practical way to handle the Hughes problem is preparing a lot of training samples until the size ...

متن کامل

Personalized Immunology in Cancer: Paving the Way Towards a Better Quality of Life

Conventionally, in specific diseases, patients receive similar therapies; relying on a "one size fits all" approach. The discovery of P5 (Predictive, Preventive, Personalized, Participatory, and psycho-cognitive) improves personalized diagnosis, treatment, and prognosis. Considering the high prevalence, mortality rate, and complexity of cancers, there is a critical necessity to choose specific ...

متن کامل

Personalized Immunology in Cancer: Paving the Way Towards a Better Quality of Life

Conventionally, in specific diseases, patients receive similar therapies; relying on a "one size fits all" approach. The discovery of P5 (Predictive, Preventive, Personalized, Participatory, and psycho-cognitive) improves personalized diagnosis, treatment, and prognosis. Considering the high prevalence, mortality rate, and complexity of cancers, there is a critical necessity to choose specific ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013